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Abstract. We analyse a biased random walk on a ID lattice with unequal step 
lengths. Such a walk was recently shown to undergo a phase transition from a state 
containing a single connected cluster of visited sites to one with several clusters of 
visited sites (fragments) separated by unvisited sites at a critical probability p c [PRL 
99, 180602 (2007)}. The behaviour of p(l), the probability of formation of fragments 
of length I is analysed. An exact expression for the generating function of p(l) at 
the critical point is derived. We prove that the asymptotic behaviour is of the form 
p(Z) ~ 3/[Z(log in 
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1. Introduction 

Anteneodo and Morgado recently discussed an interesting one-dimensional random walk 
model that exhibits a phase transition [lj. In this model, at each time step the random 
walker moves two lattice spacings to the right or one spacing to the left with probabilities 
p and q respectively. The sites visited by such a walk are, depending on p, either part 
of a single connected cluster of visited sites or many clusters of visited sites (fragments) 
separated by single unvisited sites. A transition from one class to the other takes place 
at the critical probability p c = 1/3. At large times, for p < p c the cluster of visited 
sites contains no gaps, however, for p > p c this cluster contains a finite density of 
unvisited sites. Using Monte-Carlo simulation data, a power law dependence with an 
exponent ~ 1.15 was estimated for n(l), the fraction of fragments with length I. Since 
the model is a simple one dimensional random walk that is not known to contain any 
such anamolous exponents, such a behaviour seems unlikely. We prove below that the 
correct asymptotic behaviour of n(l) at the critical point is n{l) ~ [/(log Z) 2 ] -1 which 
is infact hard to distinguish numerically from a power law dependence with exponent 
~ 1.15. 

At large times, it is easy to see (using the central limit theorem) that the probability 
distribution of the position of the walker is a Gaussian with mean p(t) = (2p — q)t and 
variance a 2 {t) = 9pqt. When the walker is biased towards the left (p < 1/3), the 
probability of the walker being on the left of the origin tends to 1 as erf(\/t). Hence 
the visited sites are part of a single connected cluster. When the walker is biased towards 
the right, in general the clusters of visited sites are separated by single unvisited sites. 
The fraction of visited sites f v is defined as S n /L n where S n is the average number 
of distinct sites visited in an n-step walk and L n is the average length of an n-step 
walk. The fraction of unvisited sites f u , is given by 1 — f v . For this case, f u can be 
calculated exactly using methods outlined in [3], pp. In the large n limit, as A — > + , 
f u = A + 0(A 2 ), where A = {p — p c )- The total number of fragments in a walk is equal 
to the number of unvisited sites and hence N, the average number of fragments is equal 
to L n A. 

The single unvisited sites at the edge of each fragment ensure that the probability 
of formation of a fragment is independent of the previous history of the walker. The 
walk can therefore be considered as a discrete process of independent increments where 
at each step a new fragment of length I is added to the positive edge of the walk with 
probability p(l). These probabilities are multiplicative, i.e.- the probability of formation 
of a fragment of length l\ followed by one of length I2 is p(h)p(l2)- The number of 
fragments of length I in a given walk is Np(l) where N is the total number of fragments. 
Therefore n(l) = p(l)/ E;=iP(0 and hence n{l) = p(l) since E/=i/>(0 = 1- 
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2. Determination of fragment formation probability 

For a new fragment of length I to be formed the walker must (i) be at the positive 
edge of the walk ( at site —1 for convenience), (ii) hop over site and reach site I by 
visiting each site between 1 and /, without visiting sites and I + 1. Thus at the end 
of each step the walker is once again at the positive edge of the walk. p(l) is therefore 
the probability of event (ii) occuring. Hence p(l) is the sum of probabilities of all paths 
that are consistent with (ii). 

We calculate q{l), the probability of condition (ii) with the constraint of 'visiting 
every site between 1 and /' relaxed. Hence q{l) is the probability of starting at site —1 and 
reaching site I without visiting and l + l. q(l) thus includes the probability of formation 
of smaller fragments within the segment [1,/]. For example g(3) = p(3) + p(l)p(l). In 
general 

q(l) = p(l) + P(xi)p{x 2 ) + p{xi)p{x 2 )p(x 3 ) + ... (1) 

This translates to the following equation involving generating functions 

/ \ P~(x) . , x q(x) 

q(x) = - HK ' r , i.e.- p (x) = - K \ , 2 
K ' 1 -xp(x) ' K ' l + xq(x) K ' 

where p(x) = E/=i p(l)x l and q(x) = E/=i q(lW- 



2.1. Exact expression for q{l) 

We calculate q(l) by summing over probabilities of all paths that are consistent with 
the above definition of q(l). For example, g(l) simply involves the walker reaching site 1 
from —1 by hopping over site 0. Thus g(l) = p. It is also easy to see that q(2) is since 
unvisited sites in this walk cannot be separated by two lattice sites. In the calculation of 
q(3) we sum over paths in which the walker starting from —1, jumps over site to reach 
1 and then over 2 to reach site 3 with probability p 2 . Once at site 3, the walker can 
move two leftward steps followed by a rightward step with probability pq 2 , any number 
of times, to return to site 3. Thus q(3) = p 2 {l + pq 2 + (pq 2 ) 2 + •••) = p 2 /(l — pq 2 )- 
Generalizing this procedure of summing over all relevant walks, we derive below a closed 
form expression of q(l) in two equivalent ways. 

The walk can be considered as a Markov process where at each time step the walker 
moves either two lattice spacings to the right or one to the left with probabilities p and 
q respectively, independent of the previous step. The probability q(l) can be calculated 
from Pi, the matrix of transition probabilities [2] with no transitions from the sites 0, 
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I + 1 and 1 + 2. For example 

q 
q 
q 
p 
p 
p 

Where the rows correspond to sites [0 1 2 3 4 5]. Consider the column vector \p(t)) 
where (s\p(t)), the element in row s, is the probability of the walker starting at site 1, 
being at site s at time step t. Now \p(t)) evolves as \p(t)) = P/|p(0)) which ensures that 
\p(t)) only contains probabilities of paths that never visit sites and l + l. Since the 
walker starts at site 1, (s|p(0)) = 8± iS . We calculate \P) where (s\P) is the probability 
of the walker, starting from site 1 being at site s, at any time step. \P) is thus the sum 
of |p(t)) at every time step. Hence 

\P) = (I + P l + P? + ...)\p(0)) (4) 

All walks that start at 1 and end at I (at any time step) contribute to q(l), therefore 

q(l) = p(l\{I - Pl)-^) (5) 

where (s|l) = 5i >s and (s\l) = 6i >s . Thus q{l) is the (1, 1) matrix element of p(J — Pi)^ 1 - 
The factor p is present because the walker must hop over site to reach 1 as in 



condition (ii). We thus obtain q(l) = p, q{2) = 0, q(3) = 1 _f 2 , g(4) 

so on. Subtituting these into Eq.flS]) we obtain p(l) = p, p(2) = 0, p(3) 
p 4 ^+3pV and so on . 



P 3 Q 
\~2pq 2 



and 



1— pg 2 



(l-pq 2 )(l-3pq 2 ) 



P(4) 

Alternatively, consider G(s), the probability of a walker starting at site -1, being 
at site s while never visiting and l + l. Now G(s) satisfies the difference equation 

G(s) = P G{s - 2) + qG{s + 1) (6) 

with the boundary conditions G(0) = and G(l + 1) = 0. A general solution of G(s) is 
Az{+Bzl+Czl which on substitution into Eq.flH]) yields the cubic equation qz 3 — z 2 +p = 
0. The solutions to this equation are z — l,z± where z± — (p ± \Jp 2 + Apq)/2q. By 
satisfying boundary conditions we obtain 



G{s) = A[l 



1 



J+i 



1) 



J+i 



— z 



i+i' 



j+i 



j+r-i 



(7) 



where A is a constant which is determined from the normalization condition G(—l) = 1. 
Alternatively from the definition of q(l) as the connecting probability between —1 and 
I we have 

q{l) = G{l)/G{-1) (8) 

We thus have a closed form expression for q(l). We verify that the two expressions 
for q(l) are equal as the matrix elements of the column corresponding to site 1 in 
p(I — Pi)^ 1 satisfy the same difference equation and, with an appropriate change of 
variables, the same boundary conditions as G(s). 
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Figure 1. Monte-Carlo plots of log(S'(/)) where S(l) = X)>/ n (0 f° r different values 
of A = p — p c , along with the theoretical curve calculated using Eq. © and Eq. © . 



3. Asymptotic behaviour 

Away from the critical point (p > p c ), for large / 

2 + 



z_(z + - z_)(z + - V 



1 



1 



;i - z_)V 



Exp[-//£] 



(9) 



where £ 



3(p-p c ) 



£ thus defines a correlation length for q(l) and hence for p{l). 



At the critical point q(l) simplifies to 



2[i - (-m - 3i(-iy 



+ 



i 

37 



for large I 



(10) 



Therefore 
Eq.© 



1 the generating function q(x) diverges as — | log(l — a;). Hence from 
-|log(l -x) 



|log(l 



for (x — > l,p = p c 
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Having found the generating function, we can extract the behaviour of p(/) for 
large /. From Eq. ljTTI) . ~p(x) ~ 1 + 3/loge where e = (1 — x). We can approximate 
the summation in the generating function by an integral ~p(x) ~ /^Expf— e l]p(l)dl. 
The singular part of this integral can be evaluated by equating p(x) ~ fl' e p(l)dl from 
which we see that $y e p(l)dl ~ —3/loge. Alternatively, from the fact that the integral 
Lim e ^o j2°° Ex P[- e ^]/(log /) 2 d/ diverges as [e(log e) 2 ] _1 (l + 0[^]) we obtain 

3 

~ 771 — m for lar S e / ( 12 ) 
/(log/) 2 

We thus have an expression for the asymptotic behaviour of p(l). 

Cases in which the rightward steps are of length k > 2 can be treated in a similar 
way. In this case the independent elements are clusters of visited and unvisited sites 
separated by k — 1 zeroes. 
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Figure 2. Theoretical plots of l/S(l) v/s Log(Z) for different values of A showing a 
limiting slope of ~ 0.33 in accordance with Eq. (TT2]) 
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